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ABSTRACT 

We have measured the 2D 2-point correlation function, £ag> between low column 
density Ly-a absorbers and galaxies at a redshift z ~ 1. We measured Ly-a ab- 
sorbers between redshifts z = 0.68 — > 1.51 over a total redshift path length of 
Az = 1.08 from HST STIS E230M absorption spectra towards the quasars HE 1122- 
1648 (z = 2.4) and PKS 1127-145 (z = 1.187). The column density of the Ly-a ab- 
sorbers ranged from 13.2 ^logio (Nhi (cm -2 ))^ 17.4, with a median column density 
of logio (Nhi (cm _2 ))= 14.0. A total of 193 galaxy redshifts within the surrounding 
6.8' x 5.7' field of view of both quasars were identified in a R magnitude limited 
survey (21.5 ^ Rvcga ^ 24.5) using the FORS2 spectrograph at the VLT, of which 
95 were higher than the minimum redshift of z = 0.68 to be used in the correlation 
function. A 3a upper-limit of £ag= 2.8 was found when 145 Ly-a absorber-galaxy 
pairs were binned in redshift space, in a bin of size Act = 1.0, Att = 2.0 hfg 1 Mpc 
along the projected separation and line of sight distances respectively. 

The upper-limit in the cross-correlation was found to be 5.4ct lower than the central 
peak in the galaxy auto-correlation within the same redshift range, £gg , which was in 
our data equal to 10.7 ± 1.4. Thus we have shown for the first time that the clustering 
between low column density absorbers and galaxies at a redshift of 1 is weaker than 
that between galaxies at the same redshift. 

Key words: galaxies - intergalactic medium, galaxies - quasars: absorption lines, 
galaxies - galaxies: haloes. 



1 INTRODUCTION 

This paper is the third in a series that investigates the extent 
to which Ly-a absorbers in the inter-galactic medium (IGM) 
are associated with galaxies. 

At the current epoch only a minor fraction of the 



baryons in the universe exist within galaxies. Stars, galax- 
ies and other stellar remnants make up 6% of the baryonic 
matter density of the universe, ilb , while within the galax- 
ies cold (T < 10 4 K), c ondensed gas contributes a further 
~ 1.7% (|Bre"gmanll2007h . Therefore at least 90% of baryons 
in the universe lie beyond the galaxies, in the IGM. 
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Part of this remaining fraction, which is only 10 — 
100 times the mean cosmic density, is more susceptible 
to shock heating. Shock waves that are caused by grav- 
itational collapse travel along the filamentary structure 
heating the gas to T = 10 5 — 10 r K. These baryons 
form the diffuse warm - hot intergalactic med ium (WHIM) 
l|Cen fc Ostrikerl 1 19991 : iFukugita et~aH Il998l ). The detec- 
tion and stud y of the WHIM using metals detected 
in ultra-violet (IVerner et al.l 1 1994 iTripp fc Savage! l200d : 



rip _ 

iDanforth fc Shulll 120051: iThom fc Chenl 120081) and x-ray 
spectroscopy ( Nicastro et al.l l2002i : iFang et all l200rj ) is a 



major science motivation for the future space instruments 
XEUS (the X-Ray Evolving Universe Spectrometer) and the 
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new COS (Cosmic Origins Spectrograph) on the Hubble- 
Space Telescope (HST). 

A third of the gas that remains is probably located 
in the diffuse IGM at temperatures T ~ 10 4 - 10 s K 
llSembach et al.ll2004l ; iLehner et ai]|2007l ; iDanforth fc Shulll 
120081 ). The neutral fraction of this ionised plasma is ob- 
served in quasar absorption spectra. Under the influence of 
dark matter this gas is thought to have collapsed to form 
clouds that are part of a filamentary network that traces 
the layout of the galaxies. Two original opposing models 
questioned whether this gas i s correlated with ga laxies but is 
part of the giant cosmic web l|Morris et al.|[l99a ) , or whether 
the clouds are discret e and isolated with t he gas contained 
within galaxy haloes (|Lanzetta et alj[l995f ). 

Even though opinion is siding with the former there 
is probably no definitive answer to this question, as the 
typical location of these clouds is expected to be a func- 
tion of their column density. At logio (Nhi (cm _2 ))~ 2f, 
the density of gas in an absorber would be comparable 
to that in the outer regions of a galaxy. Therefore at a 
small separation it would be pointless to draw a line where 



|Rao et al.ll20q4 Zwaan et al.l 2005 


) and intermediate red 


shift universe jLe Brun et al. 1997: 


Chen & Lanzettall2003l 



clouds with a column density logio (Nhi (cm _2 ))> 20.3, 
(the damped Ly-a absorbers systems) have been been shown 
in imaging and spectroscopy to be associated with galax- 
ies. Meanwhile those clouds with a column density several 
decades less dense, logio (Nhi (cm _2 ))^C 14, are more likely 
to make up the filaments or lie in the voids of the IGM. 
The absorber-galaxy association is also dependent on the 
epoch, for example because of the decrease i n the number 
density of Ly-q lines since redshift z ~ 1 .7 |Sargent et all 
1980l: lBahca"il et al ] |l99ll : IJanknecht et al.ll200fj |. 



1.1 The Issues to Resolve and Investigations so 
far 

The 2D 2-point cross correlation between Ly-a absorber and 
galaxy positions is a powerful way to quantitatively measure 
the relationship between galaxies and the IGM. It is hoped 
that these measurements will provide an insight into the 
degree of association between galaxies and the surround- 
ing neutral gas. The degree of correlation is thought to de- 
pend on the redshift, the column density of the absorbing 
cloud, the galaxy magnitude, the presence of a galactic wind, 
and the line-of-sight distance and projected separation (dis- 
tances 7T and a) between the galaxy and the absorber. 

A common way to analyse the Ly-a absorber-galaxy 
cross-correlation is to contrast the results with the galaxy 
auto-correlation. If these values are comparable or the cross- 
correlation is greater at small separation, then the Ly-a ab- 
sorbers could be claimed to be more of a feature of the galac- 
tic halo rather than the IGM. 

We now give a short overview of such measurements 
that have been made over different redshifts. 



1.1.1 Observations at Low Redshift 

iRvan-Webel J200fj ) (hereafter RW06) studied the Hl-galaxy 
correlation at redshifts z ~ 0. 5317 gas rich galaxies that 



were detected in the HIPASS galaxy survey (I Zwaan et al.l 
l2003h were correlated with 129 absorbers from 27 lines of 
sight. These spectra were gathered from STIS and GHRS 
data and all absorbers had logio (Nhi (cm" ' 2 ))= 12.41 - 
14.81. 

It was found that at small separations there is a strong 
correlation between absorbers and gas rich galaxies that 
is similar in strength to the galaxy auto-correlation func- 
tion. This supports the idea of matter clustered within the 
vicinity of galaxies (within the nodes of the cosmic web) be 
they absorbing clouds or other galaxies. However, unlike the 
galaxy auto-correlation that decays radially along the a and 
7r direction, £ag remained high out to 10 h _1 Mpc or 1000 
km s _1 along the line of sight. The proposed origin of this 
was the accretion of gas. In this scenario the gas falls in from 
the more diffuse IGM, drains along filaments and falls into 
a galactic halo causing an elongation in the redshift space 
correlation along the line of sight. The gas rich galaxies that 
were detected using 21 cm radiation also tended to have a 
low mass (M ~ 10 s Mq ) , and it is galaxies such as these that 
iKeres et all (2005) claims are dominated by this method of 
accretion. 

1.1.2 A simulation o/£ag at low redshift 

The results of RW06 were compared with simulations in 
IPierleoni et al.l ()2008l ). Three simulations, each with a dif- 
ferent galaxy wind model were run to a redshift of zero. 
Either no winds, strong winds or extremely strong winds 
were present. 

Spectra were generated from 999 sightlines that sam- 
pled the simulation. The Ly-a absorbers that had a column 
density in the range logio (Nhi ( cm_2 )) = 12.41 — 14.81 were 
then correlated with ~ 5000 galaxies that resided in haloes of 
massM = 8xiq 10 -1 13 - 5 M©. However, contrary to RW06, 
IPierleoni et al.l (|2008T ) found the galaxy auto-correlation was 
predicted to be stronger than the absorber-galaxy cross- 
correlation. 

The correlation function was recalculated using only 
27 sightlines to match the sample from RW06, and the 
sample noise increased. Bootstrap errors showed a differ- 
ence between £ag an d £gg at only a l<r significance level. 
IPierleoni et al.l (|2008T l suggest that it is this sparse sampling 
that make the cross and auto-correlation functions indis- 
tinguishable. They also attribute the large "finger-of-god" 
observed in Figure 3 of RW06 to this effect. 

1.1.3 Observations at low to intermediate redshift 
(i) \ Chen & MulchaeA \200A ): \Chen et al\ \200A) 



IChen et all (|2005T ) studied the sightline towards PKS 
0405-123, and compared the positions of 112 Ly-a ab- 
sorbers from STIS echelle spectra with 482 galaxies within 
the redshift range z = 0.1 — > 0.5. These were collected using 

a magnitude limited survey down to R ^ 20. 

The most striking result found by I Chen et alj l|2005h was 
a difference in the level of correlation that depended on 
whether the galaxy in question was dominated by emis- 
sion or absorption spectral lines. The correlation function 
between galaxies that were dominated by emission, and Ly- 
a absorbers that had logio (Nhi (cm -2 ));? 14.0, was found 
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to be similar to the galaxy auto-correlation. No significant 
cross-correlation was seen for those galaxies dominated by 
a bsorption. 

IChen fc Mulchaevl (|2009h then expanded on this result 
with two further lines of sight towards quasars HE 0226- 
4110 and PG 1216+069. It was found that within a 
projected separation of 1 h^" 1 , Mpc Ly-a absorbers with 
logio (Nhi (cm~ 2 ))^ 14 had a similar cross-correlation with 
galaxies as the auto-correlation of emission line dominated 
galaxies. This was ~ 6 times weaker than the self-clustering 
of absorption line dominated galaxies. They suggest that the 
cross-correlation was similar between emission dominated 
galaxies and Ly-a absorbers because both these occupy the 
same halo system. 

The cross-correlation between any type of galaxy and 
Ly-a absorber with logio (Nhi (cm -2 ))^ 13.5 was observed 
to be very weak. 

(ii) Papers I and II 

This paper is an e xtension to the investig ation first 
conducted in paper I, iMorris fc Jannuzll l|2006l ) (hereafter 
MJ06). Here the location of 636 galaxies, with redshifts 
determined using CFHT spectroscopy, were compared with 
381 Ly-a absorbers found in the UV spectra of quasars 
taken with the Faint Object Spectrograph during the 
Hubble Key Project. It was found that, excluding the 
high column density systems (logio (Nhi (cm -2 )) > 17), 
absorbers are correlated with galaxies at redshifts of z ^ 0.8 
and that this correlation is weaker than the corresponding 
galaxy auto -correlation. 

Paper II l|Wilman et al.ll2007l ) (hereafter W07) using the 
MJ06 dataset computed the 2-point correlation function 
along the line-of-sight (n) and projected separation (a). A 
conclusion drawn from this work suggested that at small 
co-moving separations of a < 0.4, ir < 2 h _1 Mpc, the 
correlation function increases marginally as the column 
density increases from logio (Nhi (cm _2 )) = 13 — 16, then 
there is suggestion of a sharp increase in correlation above 
logio (Nhi (cm~ 2 ))^ 17. This was proposed as tentative ev- 
idence for the column density at which absorbers become a 
part of the galactic halo. 

It was argued in W07 that a "finger-of-god" may also 
be caused by a large anisotropy, for example a quasar line 
of sight that runs parall e l and close to to a large filament. 
However IPierleoni et al.l (|2008l ') ruled out a "finger-of-god" 
being caused by this geometric effect in their simulation by 
showing that the £ag result remained the same no matter 
which orientation the sightlines had been directed. Instead 
a "finger-of-god" was caused if a sightline passed through a 
virialised system of gas and a paired galaxy. 

1.1.4 Observations at high redshift 

In order to observe the IGM-galaxy relationship at high red- 
shift (where the Ly-a absorption has now reached optical 
wavelengths) lAdelberger et all (|2003T ) correlated the Ly-a 
lines from 8 quasar HIRES or ESI spectra with 431 Lyman- 
break-galaxies. 

It was expected that greater concentrations of gas would 
be located near to the galaxies. This was found to be so as 
gas with a greater optical depth than the mean density was 



found to lie within ~ 10 h _1 Mpc of the nearest galaxy. 
However, evidence arose of an anti-correlation within ~ 0.5 
h _1 Mpc. It was hypothesised that this was caused by super - 
winds that, travelling at ~ 600 km s _1 (|Pettini et al.ll200lh . 
clear the immediate vicinity of the galaxies and heat the 
surroundings, decreasing the neutral fraction of HI and so 
increasing the transmission of QSO flux. 

With a la rger data set of 23 sightl ines and 1044 UV se- 
lected galaxies lAdelberger et all (|2005l ) found that evidence 
for an anti-correlation at small separation significantly de- 
creased. Only in a third of the cases was weaker absorption 
now detected around galaxies. 

1.2 The motivation for this work 

No results have been obtained for the redshift window z ~ 
1. The main reason for this is the difficulty of acquiring a 
sufficient number of galaxy redshifts at this epoch that are 
within the same fields of view as quasars for which there is 
high resolution UV echelle spectral data. The aim of this 
investigation was to start to bridge this gap. 

Thus we hope to shed light on the Ly-a absorber-galaxy 
correlation at redshifts z ~ 1. Then by comparison with the 
other works mentioned above determine whether the corre- 
lation evolves with redshift. 

The structure of this paper will be as follows. The data 
reduction for the FORS2 galaxy survey and absorbers from 
the HST STIS E230M spectra will be described in Sections 2 
and 3 respectively. In Section 4 the basics of the 2D 2-point 
cross correlation function used will be outlined. Results are 
given in Section 5 and we present our discussion and con- 
clusions in Section 6. 

Throughout the paper we have adopted the cosmo- 
logical parameters Q,m = 0.3, Q,a = 0.7, I170 = 0.7 and 
H = 100h 70 km s _1 Mpc" 1 . 



2 THE GALAXY SURVEY 

Galaxy surveys in two 6.8' x 5.7' regions around the quasars 
HE 1122-1648 (z = 2.40) and PKS 1127 -145 (z = 1.187) 
jCarswell et al.ll2002l ; iBechtold et al.ll2002r ) were carried out 
using the FOR S2 spectrograph at the VLT on the 20-22 
February 2006 l|Appenzeller et all [199^ 1. The total wave- 
length range of the absorbers to be used in the archived 
HST STIS data was between 2050-3100 A. This means we 
could potentially detect Lya absorbers between the redshifts 
of 0.68 and 1.55. It was our aim to observe galaxies within a 
similar redshift window. This was done by selecting galaxies 
based on a magnitude limit (21.5 ^ niR 24.5) and using 
FORS2 to detect [O II] emission appearing in the 5600-11000 
A bandpass. It is thought these [O II] emitters, indicative of 
star-forming galaxies, with their systems of feedback, out- 
flows and photoionisation are responsible for most of the 
complexity in the gas-galaxy relationship at z~ 1. 

2.1 Pre-imaging and mask generation 

Prior to the Multi-Object-Spectroscopy (MOS) observing 
run pre-imaging was taken in service mode of the two quasar 
fields using the Bessel RJ3PECIAL+76 filter. Photometric 
conditions with good "seeing" (^ 0.8) was requested for each 
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Table 1. The quasar field coordinates 



Quasar 



RA range 
(J2000) 



DEC range 
(J2000) 



PKS 1127-145 
HE 1122-1648 



11 30 21.57 : 11 29 52.50 
11 24 57.47 : 11 24 28.07 



-14 53 34.84 : -14 45 
-17 09 25.71 : -17 01 



54.81 
45.68J 



10 minute exposure in order to attain accurate astrometry 
and photometry when selecting faint (R ^ 24.5) galaxies. 

Table 1 shows the areas of sky that were included. 

These images were reduced using the FORS2 imaging 
pipeline at ESO headquarters in Garching as part of the 
routine service mode. Calibration data had also been taken 
at the same time, including bias frames, sky flats and photo- 
metric standard images. Using this calibration data the raw 
images were bias subtracted, flat-fielded and cosmic rays 
were removed. 

We performed the final stages needed to reduce the sci- 
ence images locally using the standard IMUTIL tasks within 
irafQ The overscan region of each frame was cropped and 
then images taken from chips 1 and 2 were joined together 
using IMTILE. 

The softw a re Source Extractor (v2.4) 

|Bertin fc Arnoutsl Il996l ) was then run in order to 
compile a catalogue of all the possible targets for follow-up 
spectroscopy. 

SExtractor was used to calculate the photometry of ev- 
ery target and classify each target according to whether it is 
a star or galaxy. The relevant parameters were MAG AUTO, 
an estimate of the apparent magnitude enclosed within an 
ellipse that encompasses 90% of the the flux, and CLASS 
STAR. 

The magnitude zero point was determined by measuring 
the total counts of exposed standard stars within the image 
foreground and comparing these flux values with that of 
their magnitude in the USNO-A2.0 star catalogue. 

Figure[T]shows a plot of CLASS STAR against apparent 
magnitude (uir) for all the 2666 objects extracted from the 
field of HE 1122-1648. One can tell that the reliability of 
CLASS STAR decreased as the targets become fainter. A cut 
of CLASS STAR ^ 0.8 was mad e to select galaxy candidates 
l|Crist6bal-Hornillos et ai1l2003l ). 

This plot also gave an indication as to what would be 
an effective faint magnitude cut-off. To avoid those tar- 
gets where CLASS STAR was ambiguous a faint cut of 
ran ^ 24.5 was chosen. The minimum brightness was also set 
by what could be successfully observed within the allotted 
exposure time. To remove those foreground galaxies likely 
to have redshifts too low to correlate with the HI absorbers, 
a bright cut-off of m^ > 21.5 was employed. 



2.1.1 Making the masks 

The final catalogues consisted of 1062 and 1122 objects in 
the fields of HE 1122-1648 and PKS 1127-145 respectively. 



1 IRAF is distributed by the National Optical Astronomy Ob- 
servatory, which is operated by the Association of Universities 
for Research in Astronomy (AURA) under cooperative agreement 
with the National Science Foundation. 




20 22 24 

Apparent Magnitude (R) 



Figure 1. This horseshoe-shaped plot of the CLASS STAR classi- 
fication against magnitude allowed cuts to be made so that galax- 
ies within the enclosed rectangle could be included in the galaxy 
catalogue to be sent to the mask making software. Those selected 
had 21.5 ^ m R ^ 24.5 and ^ CLASSSTAR sC 0.8. 



The lists were then passed to the FIMS software for the 
FORS2 instrument that designs custom masks for the mask- 
exchange-unit (MXU). 

We designed rectangular slits 1" by 8". This meant that 
all targets took up no more than 1 / '4 of the length of the slit, 
allowing ample space for sampling the sky so that it could 
be subtracted later. The slit sizes and the galaxy density 
meant that up to 50 slits could be made per mask. 

When investigating 2D 2-point correlation functions it 
is important that the slits are assigned randomly so as not 
to bias a particular absorber-galaxy/galaxy-galaxy angular 
separation. Therefore a random slit allocation was used in 
FIMS to ensure a random distribution of galaxies was se- 
lected when designing each mask. 

It was also important to obtain more than one mask per 
field-of-view to allow sampling of close pairs. 

The catalogue was split between those galaxies with 
21.5 ^ m_R ^ 23.5 and 23.5 ^m_R ^ 24.5 so that sepa- 
rate "bright" and "faint" masks would be used. They could 
then be exposed for different periods of time. This allowed 
us to optimise the exposure times used to reach a given 
signal-to-noise ratio for each mask - shorter exposure times 
for bright masks, and longer times for faint masks. Alto- 
gether 22 masks were designed with each mask containing 
approximately 40 slits. There were 11 for each field-of-view, 
including 5 that were assigned bright objects and 6 that had 
slits over galaxies with 23.5 ^hir ^ 24.5. 



2.2 Details of the observing run 

The FORS2 MXU was used in visitor mode with the 
R^SPECIAL+76 filter and the 2001+28 grism. This grism 
gave a central wavelength of 7450 A with a spectral range of 
5600 -S> 11000 A . Th e dispersion was 2.43 A pixel -1 with a 
Jehin fc O'Brienll2007l ). 



resolution — 380 



The plan for the first night of observation (MJD 53787) 
was to concentrate on PKS 1127-145 with 5 "faint" masks 
and 5 "bright". Very poor 'seeing' conditions (~ 1.4") and 
high winds hampered the first night of observing and it was 
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Table 2. The galactic spectroscopy that was carried out 



Mask 


No. of Exposures 


Mean 


Comments 


name 


and time (s) 


'Seeing' 




1127 "Bright" 1 


3 x 650 


1.7" 




1127 "Faint" 1 


2 x 1650 


1.4" 


No continuum 


1127 "Faint" 2 


2 x 1650 


1.0" 


Few sources 


1127 "Bright" 2 


3 x 650 


1.0" 




1127 "Bright" 3 


3 x 650 


0.80" 




1127 "Bright" 4 


3 x 650 


0.80" 




1127 "Bright" 5 


1 x 650 


1.7" 




1122 "Bright" 1 


2 x 1750 


1.0" 


TOO 


1122 "Bright" 2 


2 x 1750 


0.75" 




1127 "Bright" 1 


2 x 1750 


0.65" 




1127 "Bright" 2 


1 x 1750 


0.50" 




1127 "Bright" 3 


1 x 1750 


0.65" 




1127 "Bright" 4 


1 x 1750 


0.65" 




1127 "Bright" 5 


2 x 1350 


0.65" 




1122 "Bright" 3 


2 x 1350 


0.65" 




1122 "Bright" 4 


2 x 1200 


0.70" 




1122 "Bright" 1 


2 x 1750 


0.65" 





found that "faint" masks could not be observed in these 
conditions. 

For the second night (MJD 53788) we concentrated 
on the "bright" masks only from both fields-of-view. The 
weather improved significantly together with the 'seeing' 
which remained l". During the first part of the second 
night there was a target-of-opportunity (TOO) that inter- 
rupted the first observation. Therefore the 1122 "Bright" 1 
mask was observed again in service-mode. Two exposures of 
1750 seconds each were carried out in good conditions for 
each bright mask. The data we were able to observe over 
three nights is summarised in Table 2. 



Kelson software was then sent to the APALL programme in 
IRAF which performed a variance weighted extraction. 

The output of APALL for each slit were 4 1-dimensional 
arrays: The weighted object spectrum, the unweighted ob- 
ject spectrum, the subtracted sky background and a variance 
spectrum, all as a function of wavelength. 

Spectra were then flux calibrated using the IRAF tasks 
STANDARD and SENSFUNC. The standard star data was 
taken by ESO on the night of the observations. 



2.3.1 Identification of emission lines 

The plotting package SPLOT was used to identify emission 
lines by eye in the 1-D spectra. 

Upon successful identification of any spectral features 
the galaxy was sent to the NOAO programme RVIDLINES. 
Each emission line was fitted with a Gaussian profile that 
was centred on the emission peak. The FWHM of the profile 
was set to a default 4 pixels (9.72 A), with a 5 pixel minimum 
gap between 2 features. 

A variance weighted mean redshift was then computed 
using all of the features identified. The heliocentric correc- 
tion was made to the redshift using the header information. 
An RMS error in the redshift was also calculated using the 
redshift of each feature zf ea ture and z in equation |T}. 



Xvi — 1 (^feature ^) 



(1) 



Figure shows a typical spectrum of a galaxy where 
only [O II] emission was used to determine the redshift. H/3 
and [O III] lines were lost in the noise. The '4000 Angstrom 
break' in the SED helped to identify this line as [O II] emis- 
sion and not Ha. 

Using this method the redshifts of 180 emission-line 
dominated galaxies were found. 



2.3 Data reduction of the FOR.S2 data set 

At the time of reduction no ESO pipeline existed for the 
automatic reduction of FORS2 MXU data, so the extraction 
of galaxy spectra was mainly performed using IRAF with 
predominant use of the IMUTIL and NOAO packages. 

The science frames were stacked to remove contamina- 
tion by cosmic rays. The majority of these were removed 
using COSMICRAYS from the CRUTIL package. Signifi- 
cant cosmic rays persisted in regions with strong emission 
features. Any spurious feature that made it to the final 1-D 
extraction could later be identified and removed by interpo- 
lation. 

Each science exposure together with a flat and an arc 
frame were then sent to a ge neric spectro scopy reduction 
pipeline written by D. Kelson (|Kelsonll200"3h Fl 

This software divided the science images containing the 
2D spectra by a master fiat. The 2D spectra were then rec- 
tified to correct for any distortion along the spatial and dis- 
persion axes. Using a He-Ar arc frame the programme found 
the dispersion relation and wavelength calibrated the 2D 
spectra. The calibrated 2-D science frame output from the 

2 D. Kelson, The Observatories, Carnegie Institute of Washing- 
ton, 813 Santa Barbara Street Pasedena, California 91101 USA. 



2.3.2 Identification of absorption lines 

Poor signal-to-noise (S/N) meant identifying the redshifts 
of absorption-dominated galaxies was not possible by in- 
spection. Instead the IRAF programme XCSAO was used 
ijKurtz fc M ink 1998). Absorption features were identified 
by cross-correlating with a template absorption spectrum. 
These absorption lines were then used in RVIDLINES to 
measure redshifts for the 13 absorption- line dominated 
galaxies that were included in the final table. 

2.3.3 Redshift Confidence of the Galaxies 

Each of the 193 galaxies that made up the final table were 
then assigned a redshift confidence level based on the num- 
ber and strength of any features. The criteria for each level 
are shown in Table [3] 

Those galaxies with a ranking of 1 were most often 
AGN. The galaxies dominated by absorption lines were as- 
signed a ranking of 2 or 3 depending on how visible the lines 
were after being identified in XCSAO. 61 confident redshifts 
were found, even though only [O II] emission was visible 
because of features such as the '4000 Angstrom break'. 

The 28 [O II] emission lines for which there was no 
alternative feature could have been Ha. This meant there 



6 Shone, Morris, Crighton & Wilman 



Variance weighted Flu 
a spectrum • 
Sky subtraction 




7000 8000 
Wavelength-A 



Figure 2. A variance weighted combined spectrum of slit 36 from 
"bright" mask 1 of sightline HE 1122-1648. The redshift of this 
emission line dominated galaxy was determined using the [O II] 
emission line only. The H and K absorption lines were located 
near a cosmic ray which has been interpolated over at ~ 7500 
A . The feature at ~ 7600 A is atmospheric O2 absorption. For 
display purposes the spectrum has also been box-car smoothed 
with a window of 5 pixels. The residuals at > 9000A are a product 
of poor sky-subtraction and so the H/3 and [O III] doublet could 
not be identified. The increase in flux above 7600 A meant the 
emission line could be identified as [O II] rather than Ha because 
of the '4000 Angstrom break'. The cr-spectrum (dashed) and sky 
value (dot-dashes ) that was subtracted are also shown. The value 
of the sky spectrum has been divided by 100. 



Table 3. Confidence levels and criteria needed for the galactic 
rcdshifts 



Confidence 


Comment 


No. of 


Level 




Galaxies 


1 


2 or more strong lines 


54 


2 


2 or more moderate lines 


26 


3 


2 or more weak lines 


18 


1 


Secure single line [O II] redshift 


61 


5 


2 or more weak lines from poor spectra 


6 


6 


Ambiguous single line [O II] redshift 


28 



were a total of 34 spectra (those ranked 5 or 6) for which 
there were ambiguous results. Figure [3] shows the redshift 
distribution of the 193 galaxies binned according to whether 
the redshift was judged to be reliable. 

Those galaxies with an insecure redshift were flagged 
when passed to the code and removed from the calculation. 
The 61 galaxies only showing confirmed [O II] emission were 
also flagged in order that they too could be removed in case 
they significantly changed the result of the correlation. 

Being restricted to effectively a single night's observa- 
tion meant we did not achieve a large completeness in the 
galaxy survey. Figure|4]shows the percentage of galaxies with 
a determined redshift of the 1062 and 1122 objects found us- 
ing SExtractor. They have been plotted as a function of their 
separation from the quasar sightline. This incompleteness is 
accounted for in the calculation of the correlation function. 



Rank 1,2,3 
Rank 4 
Rank 5,6 



0.6 0.8 1.0 

Redshift 

nsize = 0.1 Median redshift = 0.80 



Figure 3. The redshift distribution of the 193 galaxies is shown 
in this figure, binned as a function of the redshift confidence. This 
plot shows that most of the galaxies in the galaxy sample used in 
the correlation were obtained from single line [O II] emission and 
had a redshift z^0.8. The dashed-dotted line at z = 0.68 shows 
the minimum redshift at which galaxies could correlate with Ly-a 
absorbers found in STIS E230M spectra. 



Survey completeness as a function of separation from the QSO 



HE 1122-1648 Completeness 
PKS 1127-145 Completeness 



100 200 3C 

Angular Separation from QSO - ' 
Binsize = 30.0' 

Figure 4. Of the 1062 and 1122 candidate galaxies detected in 
the fields of quasars HE 1122-1648 and PKS 1127-145 this plot 
shows the percentage of those with a determined redshift. The 
completeness is calculated for the apparent magnitude ranges 
21.5 ^ m R ^ 23.0 and 21.5 ^ m R ^ 24.5. 



2.4 The Luminosity Distance and K-Correction 

Using the selected ACDM cosmology stated in Section 1 the 
luminosity distance to each galaxy (Dl) was found and the 
absolute magnitude Mb of each galaxy was computed. 



M B = m R - 5 



Dl(z) 
10 pc 



+ K C 



(2) 



m_R took the value of MAG AUTO that was computed 
when SExtractor was run on the pre-imaging. As the pre- 
imaging was only exposed through the red Bessel filter a 
theoretical K-correction had to be computed using the re- 
sponse curves of two filters and the spectral energy distri- 
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Table 4. Spectra from the ESO and HST data archives 
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Figure 5. The mean absolute magn i tude Mg of the 193 galaxies 
based on the 5 iBruzual fc Charloj i2003h SED templates. The 
error bar is equal to the largest range in the possible values of 
Mg from all of the galaxies. The vertical line shows the cut-off 
at z = 0.68 above which the galaxies were at an appropriate 
rcdshift to correlate with the Ly-a absorbers and for use in the 
auto-correlation. 



butions (SEDs) of template galaxies. Details of the code 
used for the colour correction and th e template SEDs are 
described in lBruzual fc Chariot! (|2003h . 

To compute the B — R K-correction for each spectral 
type the theoretical flux that would be transmitted through 
the rest-frame B filter was computed. Each SED was then 
transformed to a higher redshift in steps of Az = 0.005 and 
the flux through the red filter was integrated. The output 
from the code was a table, with a value for the K-correction 
listed as a function of redshift and spectral type. Galactic 
evolution was not taken into account when transforming the 
SEDs. With no clear indication from the galaxy spectra as 
to the spectral type of each galaxy the B — R colours for 
all 5 templates were considered. Figure [5] shows the mean 
absolute magnitude of all 193 galaxies based on the 5 SED 
templates. From Figure [5] it is apparent that we were calcu- 
lating the cross-correlation and galaxy auto-correlation us- 
ing galaxies that had — 17 > Mb > — 21. We did not have a 
sufficient number of galaxies to investigate the dependence 
of the correlation on the galaxy magnitudes. 

Altogether the spectroscopic redshifts of 193 galaxies 
were found and were added to a master catalogue. This in- 
cluded 37 in the field of HE 1122-1648 that were at a redshift 
of z ^ 0.68 and had a redshift confidence ^ 4, and so could 
be correlated with the Ly-a absorbers. The number of galax- 
ies that could be used in the calculation that were found in 
the field of PKS 1127-145 was 58. 
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ists within the HST archive. These archived spectra cover 
the wavelength range 2050-3111 A , the range of wavelengths 
needed to locate Ly-a lines at a redshift of z ~ 1. With such 
a high dispersion, it was hoped that low column density 
(logio (Nhi (cm~ 2 ))~ 13) HI absorption lines could be re- 
solved and Voigt profiles fitted in order that their column 
density and b-parameter could be determined. 

The absorber sample in this paper has a higher resolu- 
tion than the absorber data set of lMorris fc Jannuzil (2006). 
They used HST FOS data and a fitted b-parameter that was 
fixed at 30 km s" 1 . The STIS E230M instrumental broaden- 
ing was 10 km s _1 in the spectra we used, compared to 230 
km s _1 for FOS spectra. Therefore in pr inciple we are sen- 
sitive to lower column density lines than lMorris fc Jannuzil 
(2006), and we can directly measure the b-parameters for 
absorbers, rather than assuming a value of 30 km s _1 . 

The only Ly-a absorbers that were used in the cor- 
relation function were detected using the Space- Telescope- 
Imaging-Spectrograph (STIS) in echelle mode with the 
E230M grating. In addition to these, to help identify lines 
and to compile a more complete line- list, spectra were anal- 
ysed that were taken by STIS in long-slit mode, the low- 
resolution Faint Object Spectrograph (FOS) and the ESO 
UVES (Ultraviolet and Visual Echelle Spectrograph) instru- 
ment. 

Table 4 lists the data products used in the investiga- 
tion that were collected from the STScI (Space Telescope 
Science Institu te) and ESO (European Southern Observa- 
tory) archives (|Keves et"aL 1 1 19951 ; IKim Quiiano et al.ll2007l ; 
iKaufer et al.ll2007l ) 



3 QUASAR UV SPECTROSCOPY 

The Ly-a absorbers used in the cross-correlation function 
were detected in the absorption spectra of the background 
quasars HE 1122-1648 and PKS 1127-145. These quasars 
were selected because high resolution (R= 30000), STIS 
echelle spectroscopy obtained using the E230M grating ex- 



3.1 Reduction of both long-slit and echelle spectra 

The preliminary calibration and reduction of data for each 
instrument was performed by STScI or ESO 'on-the-fly' by 
an instrument specific pipeline. The steps performed in the 
initial reduction of raw data depended on the detector. 
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3.1.1 FOS calibration 

The details for the FOS pipeline CALFOS are from 
iKeves et al l (|l995h and they differed significantly from both 
the STIS and UVES pipelines as the data was already 1- 
dimensional. The pipeline began by computing the statis- 
tical error in the raw data, this was then carried forward 
through each reduction step and used in the variance spec- 
trum. The total photon count was divided by the exposure 
time to give the count rate. Next the background sources of 
noise were subtracted. This included the dark current gen- 
erated by the instrument and scattered light from the op- 
tics. The variations in sensitivity with A from diode-to-diode 
and the illumination correction were then compensated for 
by dividing each pixel by the flat-field response. After each 
observation FOS would perform an arc exposure using the 
internal Pt-Cr-Ne lamp in order to find the wavelength zero- 
point. Vacuum wavelengths were used and the pipeline, us- 
ing the date of observation, performed a heliocentric velocity 
correction. Finally using the data from a spectrophotometric 
star the pipeline flux calibrated the object spectrum convert- 
ing the count rate to ergs s _1 cm -2 A , thereby correcting 
for the telescope and spectrograph transmission and the de- 
tector efficiency. 

3.1.2 STIS calibration 

The initial stage of the pipeline CALSTIS varied depending 
on whether the data was taken using the MAMA or CCD de- 
tector. A detailed descript ion of both pipelines is covered in 
iKim Quiiano et al.l (|2007T ). Similar to the FOS pipeline the 
first process was to calculate a statistical error, this error 
was then propagated through the reduction pipeline. The 
dark current was then subtracted and the 2D spectra were 
divided by a flat field exposure created by a lamp within 
the spectrograph. The STIS CCD data was reduced in the 
same manner as generic CCD optical data. For both UV 
and optical data the STIS pipeline then used an arc expo- 
sure to calculate the wavelength dispersion solution with the 
necessary heliocentric velocity correction. Using a spectro- 
photometric star the spectra were also flux calibrated to give 
counts in ergs s -1 cm" 2 A . The 2D long-slit and echelle 
data were then binned along the spatial axis to produce ID 
spectra. 

3.1.3 UVES Calibration 

The pipeline steps for the UVES raw data followed the same 
procedure as the STIS optical data save for two main dif- 
ferences. As UVES is a ground based instrument the sky 
needed to be subtracted when extracting the ID spectra. 

When we analysed the absorption lines of the UVES 
spectra we used vacuum wavelengths so that they would cor- 
respond to the absorption lines of the STIS data. Therefore 
the dispersion solution needed to undergo an air-to-vacuum 
wavelength correction before transforming to the heliocen- 
tric reference frame. 

3.1.4 Combining multiple exposures 

We performed the final steps in reducing the ID spectra. We 
combined exposures using variance weighting and fitting a 



QSO continuum. A different method was used depending 
on whether the spectrum was long-slit data or from echelle 
spectroscopy. 

The nature of digicon, MAMA and CCD detectors 
means that, prior to any combining of exposures, each of 
the spectra had a wavelength relation that was not the same 
or constant as AA oc A. So the first step before combining 
multiple exposures was to inspect their dispersion and then 
re-sample each of the spectra to a common wavelength so- 
lution. 

3.1.5 Fitting the QSO continuum 

When analysing absorption lines the QSO flux needed to be 
divided by the QSO continuum. 

The continua for the long-slit spectra were fit using the 
IRAF programme SPLOT. The first step when estimating a 
continuum was to crop the spectrum by ~ 100 A at both the 
red and blue ends. This was to remove data with a S/N per 
pixel ^ 2. Any emission features or large absorption profiles 
were also clipped. In particular this included the Ly-a and 
Ly-/3 emission lines if they were present. The continuum of 
a QSO spectrum was fitted by sampling the spectrum at 
points devoid of absorption, usually in regions redwards of 
the Ly-a emission, this was then extrapolated to lower wave- 
lengths. A chebyshev function of order 2 was fit to the data 
using SPLOT. The final spectra were then saved in a FITS 
format for use in RDGEN, a programme that would nor- 
malise the spectra by dividing through by the continuum 
and detect absorption features. RDGEN is part of the VP- 
FIlB package that was used when modelling Voigt profiles. 

3.2 Post calibration STIS E230M and UVES 
Echelle spectroscopy reduction 

The echelle data was composed of many exposures, these 
not only needed to be re-sampled and combined, but also 
the separate overlapping orders needed to be joined end- 
to-end while simultaneously avoiding aliasing or losing flux 
when re-sampling. For this reason a C progr am called UVES 
popler (version 0.17) (jMurphv et al.l 120071 ) was used. This 
was devised and written by M. T. Murphy □ with the inten- 
tion of reducing post-pipeline UVES echelle data. The input 
for UVES popler were all the optimally extracted echelle or- 
ders from the STScI CALSTIS and UVES pipeline and the 
associated variance spectra. Combining these orders, resam- 
pling the data, setting a common dispersion solution and 
normalising by the QSO continuum was then an automated 
process. 

Parameters were passed to UVES popler that gave the 
function and order to fit for the continuum. A chebyshev 
function of order 2 was selected above and below the Ly-a 
emission line. High column density absorption lines were ig- 
nored by UVES popler by rejecting any data that was in the 
lower 50% of the flux values below the Ly-a emission line. 

3 ©2007 R.F.Carswell, J.K. Webb, M.J. Irwin, A.J. Cooke, In- 
stitute of Astronomy, Madingley Rd, Cambridge, CB3 0HA,Uk. 

4 M. Murphy, Centre for Astrophysics & Supercomputing, Swin- 
burne University of Technology, Mail 39, PO Box 218, Hawthorn, 
Victoria 3122, Australia 
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Flux values that made up the lower 30% of the data above 
Ly-a were also ignored. A higher tolerance was permitted at 
these wavelengths as there were only a few metal absorption 
lines and no forest absorption. 

The level to set the continuum was then iteratively im- 
proved by neglecting data that remained 1.2 a below a pro- 
visional continuum line and any emission/noise features that 
existed 3cr above. We visually inspected the final continuum, 
adjusting it by hand where necessary. Errors in the contin- 
uum level may introduce small errors in the inferred column 
densities of lines for regions of our spectrum with low S/N 
3), but we do not expect these errors to affect our mea- 
surement of the line redshifts. 

Both HE 1122-1648 and PKS 1127-145 were fit with the 
same parameters for the STIS and UVES spectra. 

3.3 Producing a Quasar Absorption line-list 

Locating the Ly-a absorbers that would be used in the cross- 
correlation function at a redshift z ~ 1 required the STIS 
E230M data. The other spectra were used for completeness, 
for the identification of metals and to corroborate the iden- 
tity of STIS Ly-a absorption lines. 

The Ly-a and Ly-/3 emission lines of quasar PKS 1127- 
145 are located at 2658.67 A and 2243.25 A. Therefore only 
the Ly-a series lines were within the wavelength range of the 
E230M data. Ly-a, Ly-/3 and Ly-7 emission lines for quasar 
HE 1122-1648 were at 4133.28 A, 3487.46 A and 3229.13 A 
respectively. Hence many Lyman series absorptions were lo- 
cated in the UVES and STIS spectra. Locating these lines 
first was essential so that higher order Lyman lines could be 
identified and removed. Any remaining lines could then only 
be Ly-a absorptions (or metals). 

We used RDGEN to locate absorption lines. Using the 
velocity FWHM of the instruments; that were 10.0 km s _1 , 
6.7 km s" 1 and 230 km s' 1 for the STIS E230M, UVES and 
FOS G190H gratings respectively. The output from this was 
a table that included the wavelength, the observed equiva- 
lent width (EW), the error associated with these parameters 
and the significance of the absorption line. 

The significance level of each line was defined as, given 
error in the EW ctew- 

SL=™ (3) 

0"EW 

At this stage all the lines within the STIS spectra where RD- 
GEN had estimated an EW significance a ^ 3 were passed 
forward to the line-fitting programme VPFIT. No contribu- 
tion to the error in the EW caused by the error in the level 
of the continuum was included at this point. 

Absorption lines were fitted with Voigt profiles us- 
ing VPFIT (version 9.3). Using the instrumental velocity 
FWHM VPFIT would attempt to model the absorption line. 
This was achieved by a \ 2 minimisation when comparing the 
data and fitted Voigt profiles. The parameters of the lines 
were then returned together with la errors. The input to the 
programme was a table of the absorbers detected using RD- 
GEN that included the ion species and an approximate ini- 
tial guess for the redshift, column density and b-paramet er. 
The atomic data required was taken from lMortonl (|2003l 'l. 

The following steps were made when identifying and 
then modelling the lines. First the galactic lines such as Mg n 



and Fell were identified and flagged as any redshifted line 
that was blended with these had to be ignored. 

The second step was to identify redshifted metal lines 
above the Ly-a emission line. No redshifted metal lines 
were detected at any wavelength in the STIS spectra. 
Those detected in UVES were associated with a damped 
(logic, (Nhi (cm -2 ))^ 20) system. The HE 1122-1648 FOS 
G190H data exhibited a damped system at z = 0.681 
(logic, (Nhi (cm _2 )) = 20.5 ± 0.03). Civ, Mgn and Fell ab- 
sorption lines located in the UVES spectrum are associated 
with this. 

The other method employed to identify metal lines such 
as C IV and Mg 11 was by their doublet wavelength separa- 
tion. 

Metal lines were found within the Lyman-series forest, 
in particular the doublets C IV (AA1548.20, 1550.77) and O VI 
(AA1031.93, 1037.62). As this was done by eye only two ways 
were known to discriminate between metals and possible 
Ly-a lines. First if both lines could be spotted there was the 
doublet spacing. In addition, for metal lines the b-parameter 
is usually ^ 10 km s _1 while for L y-q b it is seldom ^1 5 
km s -1 and cannot be < 10 km s _1 (jjanknecht et al] [2006). 
So an inspection of the FWHM gave a good indication as to 
the nature of the species. 

With the metal absorption lines identified it could now 
be assumed that those left were Lyman lines. First of all, any 
remaining unidentified lines within the Ly-a forest were as- 
sumed to be HI. RDGEN was then used to plot the output of 
VPFIT and the location of the Ly-/3 lines were highlighted. 
VPFIT was then rerun if there was a positive match so that 
both lines could be used to constrain the errors. Any line re- 
maining unidentified within the Ly-/3 forest could only then 
be Ly-a. This process was repeated towards shorter wave- 
lengths until the blue end of the STIS echelle spectrum was 
reached. In the case of PKS 1127-145 only the Ly-a forest 
was visible. The damped system at z = 0.681 in the HE 
1122-1648 spectrum prevented any further lines being dis- 
covered at shorter wavelengths. 



3.3.1 The minimum detectable column density 

An effective way to decide the minimum detectable EW 
for the Ly-a lines was to invert the spectrum around the 
continuum, and using the b value of the smallest real Ly- 
a absorber we attempted to fit an 'absorption line' to 
the inverted spectrum. A line that we knew was a sam- 
ple of the noise. When the column density and lcr error 
to the largest noise profile became indistinguishable from 
that of the smallest known real absorber this gave the min- 
imum reliable value. For the PKS1127-145 and HE 1122- 
1648 STIS spectra the minimum detectable column densities 
were logic (N m (cm" 2 ))= 13.2 and logic (N m (cm -2 )) = 
13.5 respectively. For the UVES HE 1122-1648 spectrum 
where the S/N was ~ 120 p ixel" 1 at the central wave- 
length (|DairAglio et al.1l2008l ) this minimum decreased to 
logio (Nm (cm" 2 ))= 11.7. 

VPFIT would sometimes attempt to fit these weak lines 
with unphysical b-parameters. As the Doppler parameter is 

oc any Ly-a line that was fitted with b ^ 10 km s _1 

was dropped from th e list, where we have ad opted the same 
cut that was used in Ijanknecht et all (|2006T ). 
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3.4 Extracting lines from data with very poor 
signal-to- noise 

3.4-1 Fixing the variables 

We set VPFIT to terminate if the la error in the column 
density was greater than Alogio (Nhi (cm" ~ 2 ))= 0.5. When- 
ever this point was reached we would fix the b-parameter to 
the median b value of those lines that had been measured. 
This was 26.5 km s _1 . This was done for all of the Ly-a lines 
below 2300 A regardless of Alog(N H i) in the HE 1122-1648 
STIS spectrum where the S/N was very poor. 



3.4.2 The blue HE 1122-1648 STIS E230M spectrum 

The S /N of this spectrum was below 2 for most wavelengths. 
The wavelength range was meant to cover 1574 -» 2382 A 
but the data became unusable below the damped system at 
2050 A . 

In an effort to recover the absorption features in this low 
S /N data, we summed the flux over a narrow bandpass of 24 
pixels and then plotted this as a function of wavelength. The 
wavelength of the absorption lines that were lost unbinned 
data could then be determined. Knowing where these lines 
were meant that Voigt profiles of a fixed b-parameter could 
then be fit. 

The spectrum was cropped to remove those wavelengths 
below the DLA Ly-a line. The flux absorbed (di) per pixel 
i was then summed over a window of size M+l using the 
following calculation. Ci is the continuum level normalised 
to 1 for all i and Fi is the flux. 



*= E (Ci+j-Fi +j ) x (Ai +j+ i - A i+ j) (4) 



The variance of this result was also computed where the 
values were summed in quadrature. 



\ 



(Ai+j+i - Ai+j)] 2 



(5) 



The summed absorbed flux as a function of wavelength 
is plotted in Figure [5^,. 

Two identifications were made in this spectrum. These 
were two Ly-/3 lines associated with the systems at z = 1.229 
and z = 1.234, these can be seen in Figure [6^. at ~ 2290 A. 
The smallest profile had a width of 24 pixels, so assuming 
this profile which had an EW significance a = 8.7, was typ- 
ical of the minimum width we could hope to identify, this 
was chosen to be the window size M. 

The next variable to find was the minimum detectable 
EW for a line. For this we adapted the earlier method of 
inverting the spectrum to sample the noise. Equation @ 
was applied to the inverted spectrum using the same window 
size, this is plotted in Figure[6p. The peak inverted summed 
'flux' had a value of 0.54. Using this cut on the real summed 
spectrum meant only 28 absorbers would be included. So by 
using the peak noise value there was a danger that many 
real Ly-a absorbers would be lost. 

The red HE 1122-1648 STIS E230M spectrum had a 



the line density distri bution is almost flat at redshifts z ^ 1 
l|Kirkman et al.l 120071 ), so a similar value is expected. The 
conservative cut that left 28 absorbers implied a line density 
of M = 165. This cut-off needed to be reduced by 20% to 
a minimum flux-deficit of 0.43 before a similar line density 
could be observed. Figure [5p shows such a minimum cut- 
off would allow a lot of false identifications from smoothed 
noise. 

Therefore the cut was varied between 0.43 and 0.54 in 
order to see to what proportion of lines would be excluded 
when compared to the density of Ly-a lines that are ex- 
pected. Eventually the cut-off in Figure[5p was set to exclude 
all but the highest noise peaks at 0.49. In doing this 46 ab- 
sorbers were identified. A programme was then written that 
measured the wavelength of these peaks. Those that were 
higher order HI lines or had an EW significance less than 
3 were removed. This left 31 Ly-a absorbers that could be 
measured using VPFIT. 

The original flux and variance spectra spectra were then 
smoothed using a box-car window of 11 pixels and this was 
used in the VPFIT programme. VPFIT was now successful 
in modelling the Voigt profiles with la errors in the column 
density Alogio (Nhi (cm _2 ))< 0.5 and these absorbers were 
added to the final line-list. All of the lines found using this 
method had a b-parameter fixed to the median value found 
for those lines that had been fitted successfully at higher 
wavelengths of 26.5 km s _1 . 



3.5 Variance Weighted Equivalent Width 
Significance and the Continuum error 

All Ly-a lines with an equivalent width significance SL ^ 3 
were included in the cross-correlation f unction, this is the 
level adopted in ISchneider et~aU (|l993h - However we had 
to fix the b-parameter for 42 out of the 135 Ly-a lines in 
the STIS spectra that had an equivalent width significance 
of 3, and assume the continuum was correct. Therefore the 
EW and significance of the lines were measured again. This 
time a Gaussian weighting was used where changes in the 
significance of a line because of errors in the continuum were 
also considered. The formulae used to calculate the EW an d 
significance have been adapted from Schneider et al. I (|l993h . 

Consider an absorption line that covers J pixels and is 
centred on pixel i with the continuum C set to 1. The EW 
of the profile is equal to: 



EW : 



+J-J/2 



Fi 



+J-J/2J 



(6) 



AAi was assumed to be constant across the absorp- 
tion, and r/j is the Gaussian weighting coefficient where 
Eti^j = L The FWHM of the Gaussian profile for each 
absorber was based on its b-parameter. 

The amount that the variance spectrum (a 2 ) con- 
tributed to the error in the EW was computed using: 



<5(EW, 



(7) 



Ly-a line density of 



198 and all results indicate that 



The significance of each line was then, like equation Q, the 
ratio of equations ([6]) and (0. 

Our approach when testing the effect of changes in the 



The Association Between Gas and Galaxies 11 



a) Summed flux absorbed as a function of wavelength 



Variance 

Final significant absorber cut-off 
Minimum absorber cut — off 



Ly/3 




2050 



2100 



2150 



2200 
Wavelength-A 



??5c 



2300 



2350 



nverted summed "flux" absorbed as a function of wavelength 



0.5 




-- Variance 
- - Final significant absorber cut — off 
Minimum absorber cut — off 



2100 



2150 2200 225C 

Wavelength-A 
Box-car size = 24 pixels in both plots 



2350 



Figure 6. (a) In order to locate Ly-a absorbers in the blue STIS E230M spectrum of HE 1122-1648 the absorbed flux was summed over a 
window of 24 pixels using equation ((4J. This way any absorbers were enhanced and could be modelled using VPFIT. (b) The 'significant 
absorber' cut-off lines were set by inverting about the continuum the spectrum and sampling the noise. To recover the expected line- 
density at this redshift would require a cut in the flux deficit of 0.43, however many noise features would then be included (dashed-dot 
line). Thus a more conservative cut of 0.49 was chosen. After a final EW significance level cut SL ^ 3 31 Ly-a lines were identified. 



level of the continuum was to estimate the significance levels 
when an error S (Cj — Fj) of 10% was added. VPFIT was not 
rerun to find new values for the b-parameter or the column 
density. 

(5Ew jcont ) 2 = ( a^p.) ) 2 ^^-^)) 2 ( 8a ) 

.-. (5EWcont) 2 = W-^O xO.01 (8b) 

Errors in the EW, that included contributions from 
both the continuum and variance, were calculated using 
equations Q and (|8b[) summed in quadrature. Those ab- 
sorbers that had a new variance weighted equivalent width 
significance a ^ 3 were used in the cross-correlation. 

It was found that even with a 10% error margin in the 
continuum, the Gaussian weighted significance of all the 
STIS absorption lines did not change by more than 1% 
and no additional lines were lost because of the 3<r cut. 
This was because of the relatively high minimum observ- 
able optical depth produced by a Ly-a column density of 
logio (Nhi (cm- 2 ))= 13.2. 

Figure [7] shows the EW of the 135 lines detected in the 
STIS spectra that had a significance level 3. 

One can observe that the peak in the redshift distribu- 
tion of galaxies in Figure[3]is at redshifts z ~ 0.8. Thus many 
of the pairs in the correlation according to Figure [7] were 



made with those absorbers from the spectrum with poor 
S/N. These are absorbers with a higher minimum EW limit 
hence a lower completeness. Pairs from this region made up 
72 of the 171 detected in the cross-correlation plot in Figure 
112b .. The effect of this increase in the minimum EW means 
we may over-estimate the cross-correlation. It is absorbers 
with a small EW that also have a low column density, and 
we do not expect any corr elation with absorbers t hat have 
logio (Nhi (cm -2 ))^ 13.5 (|Chen fc Mulchaev||2009l ). 



3.6 The Final line-lists 

The final line-list for HE 1122-1648 contained 740 absorp- 
tion lines that were detected in the STIS E230M and 
UVES echelle data that had an equivalent width significance 
greater than 3. This included 586 Ly-a lines each with a b- 
parameter ^ 10 km s _1 and 154 metals. Of these Ly-a lines 
97 came from the two STIS E230M spectra. 

Fewer Ly-a lines were detected in the STIS spectrum 
of PKS 1127-145 due to the lower quasar redshift. 38 Ly-a 
lines were found together with 23 metal lines in both the 
STIS and UVES spectra. Therefore over a total path length 
of Az = 1.08 we detected 135 Ly-a absorbers that could be 
used in the correlation function. 

The minimum column density for the Ly-a absorbers 
used in the correlation function was logio (Nhi (cm _2 )) = 
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Figure 7. The equivalent width of those Ly-a absorbers with a 
significance a > 3 as a function of redshift. The increasing mini- 
mum detectable EW of those absorbers from opposite ends of the 
HE 1122-1648 STIS spectrum can be observed. Those absorbers 
below the vertical line at z = 0.86 were detected in the low S/N 
blue STIS spectrum. 



constant was calculated for each field before the pairs were 
binned together. 

To compare the values of £ag against a known observ- 
able the galaxy auto-correlation function £gg was calculated 
using the same techniques as the cross-correlation. 



4.1 Measuring separations 

The line of sight co- moving distance (Dc), to both galaxies 
and absorbers that are at a redshift z, was calculated using 
equation (110[) . 



c 

Ho 



dz' 



{ y^Mtl+z'f+fiA 



(10) 



The line of sight separation, it, was then it — |Dc galaxy — 

Dc absorber | ■ 

The angular separation between 2 points on an image 
that have been projected from the celestial sphere is given 
by equation where 8 and a are the declination and R.A 
of the targets in radians. 



13.2. The maximum column density detected was 
logio (Nhi (cm -2 ))= 17.4, however only 11 absorbers out 
of those correlated had a column density greater than 
logio (Nhi (cm _2 ))= 15. The median column density was 
logio (Nhi (cm -2 )J= 14.0. The complete line-lists for both 
HE 1122-1648 and PKS 1127-145 are available in electronic 
format. 



D z = arccos (sin 5 abs sin <5 ga i + cos <5 abs cos <5 ga i cos (a ga i - a a bs)) 

(11) 

The RA and DEC of each absorber were the J2000 co- 
ordinates of the quasar in question retrieved from the SIM- 
BAD database. Using the mean line of sight distance to- 
wards both the absorber and galaxy, the impact parameter 
was then calculated to be a — D^Dc- As long as the n sep- 
aration was ^ 20 h^Q 1 Mpc, each real galaxy-absorber pair 
was then written out to a file for subsequent binning. 



4 THE 2-POINT CROSS-CORRELATION 
FUNCTION 

When investigating the relationship between gas and galax- 
ies the calculation for the 2D 2-point correlation was based 
on the estimator devised by iDavis fc Peebles! (|l983h . The 
2D 2-point correlation function (£ag (f,7r)) in equation (J2J) 
gives the measure of association between the Ly-a absorbers 
and galaxies across both the projected (a) and line of sight 
(n) separations by comparing the real distribution of pairs 
with random catalogues. 

?ag(<t,7t) = Nq S o^-1 (9) 

Along each line of sight every absorber was paired with 
every galaxy and the impact parameter and line of sight 
separation in co-moving coordinates was calculated for each 
pair. Each random absorber was also paired with every ran- 
dom galaxy and the separation found between these pairs. 
DD and RR are the number of real-real and random-random 
pairs respectively in each bin along the n and a separations. 

The normalisation constant Nqso fixed the total num- 
ber of random pairs to the same value as the number of real 
pairs across all a and 7t. This way £ag (o", 71 ") gave a value 
for the fractional excess of pairs compared to a random dis- 
tribution. Due to the different number of real pairs that 
occur along each quasar sightline a separate normalisation 



4.2 Generating the random catalogues 

4- 2.1 The random galaxies 

When producing the random catalogues it was important 
that the random galaxies have the same position on the sky 
as the data in order to keep the mask design information. 
Therefore adopting the technique used in W07, each real 
galaxy that lay in the redshift range of the STIS E230M 
spectra of that particular line-of-sight was replaced with 200 
fake galaxies in that position on the sky. 

In order not to bias the 2D correlation function it was 
also important that the redshift distribution of the random 
galaxies be the same as the real sample. To achieve this the 
redshift of each randomly generated galaxy was a perturba- 
tion of 8z = ±0.01 — > 0.05 from the galaxy that spawned 
it. This range was chosen so that no random pair generated 
from the same real pair would appear within the plotting 
space of Atv — 20.0, Act = 5.0 h^ 1 Mpc. For example there 
are 37 galaxies in the field of HE 1122-1648 between red- 
shifts z = 0.68 -> 1.51, hence there were 37 x 200 = 7400 
random galaxies. 

4-2.2 The random absorbers 

The random absorbers were generated in the same fashion. 
Each real absorber was replaced with 200 randoms, each 
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with the same column density, b-parameter and EW as the 
original. 

So that the random absorbers would have the same 
line-density as the real sample, the redshift of each ran- 
dom absorber was also a perturbation from the original. 
Sz — ±0.06 — > 0.1. The separation between all random ab- 
sorbers and random galaxies was then collected and likewise 
binned if An < 20.0 h^ 1 Mpc. 

Using 200 random targets per real absorber or galaxy 
was decided on a trial basis. This value was a compromise 
between limited computing power and ensuring that the RR 
pair distributions were sufficiently smooth, with little vari- 
ation in the number of pairs along the line of sight at sepa- 
rations of equal impact parameter. 



4.3 Binning for the 2D plot 

We selected pairs separated by less than or equal to 20 and 
5 co-moving h^ 1 Mpc along the n and a directions respec- 
tively and binned them on a grid. Each value in this array 
was then divided by the equivalent bin in the RR array 
and 1 subtracted to give £ag (c ,?!")■ O ur choice of binning 
was Act = 1.0 hfo 1 Mpc, and Ait = 2.0 hf,, 1 Mpc. This was 
coarser than the earlier studies of RW06 and W07, who 
binned Ait = Act = 0.1 h wo Mpc and Air = 2.0 Acr = 
0.4 h? Mpc. We required this coarser sampling because of 
the small number of real pairs. 



4.4 The error calculation 

In the previous studies of RW06 and W07, the uncertainty 
in £ was calculated by jackknife resampling of all the sight- 
lines. The correlation function was recalculated removing 
one quasar at a time and an error determined for each bin. 

Having only two quasars in this study an alternative 
method was chosen where mock galaxy tables and linelists 
were created with the same number of absorbers and galax- 
ies and redshift distribution as the real data. The genera- 
tion of mock redshifts was identical to that used to gen- 
erate the random catalogues described above. These were 
then treated as real data sets and run using the exact same 
code. 50 of these mock linelists and galaxy tables were pro- 
duced together with 50 estimates for the correlation function 
made by removingone of the mock absorber and galaxy cat- 
alogues at a timelj A RMS error based on the sample vari- 
ance was then calculated using equation (|12[) . With N = 50 
mock datasets, 



"SO,*-) = \ ~ 



N 



(12) 



The Poisson error was found to overestimate the sig- 
nificance of the correlation result (W07), and so was not 
calculated. 



5 To avoid confusion 'mock' catalogues are ones treated as real 
data in order to estimate an error and all have an independent 
value for DD. Random catalogues, be they from the real or mock 
data, are used in the denominator of the estimator. 



Figure 8. The curve in this diagram shows the increase in the 
maximum possible observable impact parameter when using the 
6.8' X 5.7' detector on the FORS2 instrument. This plot was de- 
rived using our stated cosmology and assuming that the galaxy 
was located in the corner of the detector, the angular separation 
59 2 = 3.4' 2 + 2.85' 2 from the QSO at the centre of the chip. The 
constraints of the size of the field and the maximum galaxy red- 
shift meant we were able to correlate galaxies with absorbers out 
to a projected distance of a = 5 h^Q Mpc. 



5 RESULTS 

5.1 The Galaxy Impact Parameter Distribution 

The maximum possible impact parameter between Ly-a ab- 
sorber and galaxy pairs is a function of redshift and was 
constrained by the 6.8' x 5.7' field of view of the FORS2 
detector. Figure [8] shows a plot of the maximum possible 
impact parameter. The solid line in Figure [9] is a histogram 
of the projected separation between all 193 galaxies in the 
survey and the central quasar sightline. 

The initial shape is caused by the increase with redshift 
in the number of galaxies located within each concentric 
cylinder centred on the quasar. 

This distribution then peaked along a plateau at a pro- 
jected separation of a = 800 — 2000 h^ 1 kpc before rapidly 
declining above a — 3000 h^ 1 kpc. This flattening of the dis- 
tribution reflects the peaks in the histogram of Figure [3] and 
the expected range in impact parameter at these redshifts 
(see Figure [Sj . 

Figure [3] shows most of the galaxies were detected at a 
redshift z ~ 0.8. Therefore the peak in Figure H] must occur 
below a ~ 3000 h^ 1 kpc. The number of galaxies per bin 
had already started to decline by a — 2000 h^, 1 kpc. This 
was caused by a decrease in the radial density of slits as the 
edge of the rectangular 6.8' x 5.7' detector was reached. 

The number of galaxies above a = 4000 h^ 1 kpc was 
low because the size of the chip meant that a minimum 
redshift z > 1 was required to reach this impact parameter. 

The dashed histogram is a completeness test that shows 
the expected separation had every galaxy within the same 
magnitude range (21.5 ^m_H ^ 24.5) been targeted. Mock 
galaxies were placed with a uniform spacing over the same 
6.8' x 5.7' area. The required surf ace density f or thi s mag- 
nitude cut came from Figure 4 of iHogg et all (|l997f l. Each 
mock galaxy was then assigned a random redshift taken from 
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Figure 9. The solid line histogram shows the number of galax- 
ies surveyed as a function of impact parameter binned by 200 
h^, 1 kpc. The number of galaxies observed has an expected in- 
crease, peaking at 800-2000 h^ 1 kpc as the survey volume in- 
creases. The edge effects of the chip and the drop off in the num- 
ber of galaxies at high redshift then cause a decrease in the num- 
ber of galaxies beyond 3000 h^, kpc. The dotted line represents 
the expected number of galaxies in a complete magn itude limited 
survey (21.5 ^ R ^ 24.5 ) where the data is from ICohen et al.l 
(|200d) ; lHo"gg et alj lll997h . This plot shows that even though we 
were able to sample the correct impact distribution for the galax- 
ies the survey was far from complete. 
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Figure 10. This histogram binned by 0.1 in redshift shows the 
distribution of all Ly-c* absorbers with significance a ^ 3 detected 
across both lines of sight and both the STIS and UVES instru- 
ments. The dashed-dot line marks the redshift cut-off between the 
two instruments and Ly-« absorbers above this were only found 
along the sightline of HE 1122-1648. Ly-a absorbers below the 
long dashed line were found in the low wavelength HE 1122-1648 
dataset. a damped Lyman system prevented any more absorbers 
being detected with a redshift below z = 0.68. No echelle STIS 
data below a redshift of z = 0.896 existed for QSO PKS 1127- 
145. The asterisks mark the redshift of quasars PKS 1127-145 at 
z= 1.187 and HE 1122-1648 at z= 2.4. 



the Hubble De ep Field North Sur vey redshift distribution 
from Table 4 of lCohen et~ai1 l|200d ). A fiducial redshift cut- 
off for the mock galaxies was chosen to be 1.8. We then cal- 
culated the impact parameter of each mock galaxy from an 
imaginary sight line located at the centre of the CCD. This 
was repeated using 1000 mock galaxy catalogues in order to 
calculate the expected impact parameter distribution. 

Figure [5] shows that our galaxy survey follows the same 
general shape as the expected distribution. The mock cat- 
alogue also peaks in the range a ~ 1000 - 2000 hfo 1 kpc 
before decreasing above a = 2000 h"^ 1 kpc because of edge 
effects. However the figure also shows that our survey was 
far from complete. 

5.2 The Ly-a Absorber Redshift distribution 

Figure [10] shows a histogram of all the Ly-a absorption lines 
detected with an equivalent width (EW) significance a ^ 3 
from the STIS and UVES echelle spectra across both lines- 
of- sight. 

All of the Ly-a absorbers below z = 1.51 were found in 
the STIS data and are the 135 lines that were used in the 
cross-correlation function. Those above are from the UVES 
spectrum and hence are all from the HE 1122-1648 sightline. 

Those below z = 0.86 were found in the blue STIS data 
set that covered the wavelength range 2050 — 2382 A and 
so are also only from HE 1122-1648. The asterisks mark 
the redshifts of quasars PKS 1127-145 and HE 1122-1648 at 
z = 1.187 and z = 2.4 respectively. 

Three factors affect the shape of this histogram. The 
line-density distribution only evolve s slowly out to redshift s 
z ~ 1.6 and then increases rapidly jjanknecht et al.1 l2006). 



Our data agrees with this result within the error margin. Our 
data is from only two sightlines, one of which terminates at 
z = 1.187, but also the deficit of lines at z ~ 1.5 is attributed 
to the decreasing S/N and low sensitivity at the extreme red 
end of the STIS spectra of HE 1122-16478 (see Figure Q. 

The increase in the line density above z = 1.5 was also 
due to the increased sensitivity and S/N of the UVES in- 
strument. 

5.3 Combining the Ly-a Absorber and Galaxy 
datasets 

Figure [11] shows the location of both galaxies and absorbers 
across each sight line. 

The maximum impact parameter (~ 5000 kpc) shows 
the distance out to which the correlation function was calcu- 
lated, and also explains why so few pairs were found in the 
highest separation bin. Only those galaxies with a redshift 
z :> 1 could achieve this separation. 

Pairs that were used for the correlation function had a 
maximum line of sight separation of 20 co-moving h - , 1 Mpc. 
At a redshift of z ~ 1 this equates to Az ~ 0.05. There- 
fore each absorber in the Pie diagrams was only paired with 
galaxies within a very narrow redshift window, the size of 
which is too small to resolve in Figure [TT1 

5.3.1 The Proximity Effect 

The line-of-sight proximity effect (|Baitlik et al.1 fl988) de- 
scribes the process where ionising radiation from a quasar 
lowers the neutral fraction of HI, thereby decreasing the 
line-density of Ly-a absorbers. For lines close to the quasar 
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Figure 11. The Pie-diagram for quasar fields HE 1122-1648 and PKS 1127-145. The top panels, plots (a) and (d) show the impact 
parameter between the galaxy and the quasar sightline plotted as a function of redshift. Panels (b),(e), (c) and (f) show how this separation 
is split between the RA and DEC of each quasar field respectively. The increasing maximum separation with redshift, observed in plots 
(a) and (d) mark the boundary of the survey volume. The maximum redshift plotted only covers the overlap between the two populations, 
so only absorbers from the STIS E230M spectra have been included. The narrower Ly-c* absorber redshift window of the PKS 1127-145 
field is because of the lower redshift of this quasar. 



redshift we may also detect gas that is associated with the 
quasar itself. This is a concern for the PKS 1127-145 field 
since galaxies are located near to the QSO redshift. For this 
reason the 3 L y-q absorbers and 3 galax ies that were within 
3000 km s _1 jChen fc Mulchaevl l2009h of the quasar were 
excluded from the line-list and galaxy catalogue when cor- 
relating pairs. 



Binned to this level we can conclude the following. 

• For low column density absorbers (logio (Nhi (cm -2 ) ) ^ 
17.4 of which only 11 have logio (Nhi (cm~ 2 ))^ 15), there is 
no evidence for correlation between the Ly-a absorbers and 
galaxies. The maximum value in Figure 112( c) was £ag = 



2.2 ± 0.9 at 7T 



6. 



— 1 h 70 Mpc. So within 



20 x 5 hy Mpc a 3cr upper-limit for the cross-correlation is 
Cag = 2.8. 

The bin with the peak in the correlation function con- 
tained 4 real pairs (1.25 random). Excluding the a — 4 — 5 
h^g 1 Mpc separation 3 bins contained no pairs giving £ag = 
— 1.0. One of these bins occurred at7r = 6 — 8, a — — 1 
h^, 1 Mpc with the minimum value of £ag = —1.0 ± 0.8. So 
hfg 1 Mpc. These pairs were made of Ly-a absorbers that had there is a 2.6a variation in £ag between the maximum and 
an equivalent width significance of at least 3 and galaxies of minimum values along the line of sight at a = — 1 hf Mpc. 
a redshift confidence < 4 according to the flags in Table [3] A similar variation in £ag is also shown at a = 1 — 2 hf Mpc 

across the same range in values. The values of £ag varied 
from £ag = — 0.4±0.5 to £ag = 1.9±0.6. Comparable values 
of £ag are also found at a = 2 — 3 h^ 1 Mpc, with a peak of 
£ag = 0.7 ± 0.5. So there are no significant trends observed 
in the degree of correlation along either axis. 

• Binned to this level and at these redshifts obviously 
we see no evidence for a "finger-of-god" at small projected 
separation along the line-of-sight. 



5.4 Results for the 2D 2-point Correlation 
Function 

Figure [T2T a1 shows the 145 real Ly-a absorber-galaxy pairs 
out to 20 x 5 hfo Mpc binned with Ait = 2.0, Act = 1.0 



Above a 4 hy Mpc there were very few real pairs 
detected. For this reason this last column should be ignored 
in subsequent plots as the Poisson noise introduced large 
fluctuations in the results for £ag in these bins. Where this 
occurred the highest value in the colour bar of all of the 
plots ignores these maxima. Figure [T2T b) shows the plot of 
the normalised RR counts. The DD/RR-1 correlation plot 
is in Figure [HJc) and the associated error in Figure [T2T d). 
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a - Projected distance (Mpc) a - Projected distance (Mpc) o - Projected distance (Mpc) o - Projected distance (Mpc) 



Figure 12. Plot (a) shows the real absorber-galaxy pairs out to 20 X 5 h^ Mpc, binned Sir = 2.0 5a = 1.0 h^Q 1 Mpc. Those galaxies with 
a maximum confidence level of 4 have been included in this plot, with all Ly-ct absorbers with an equivalent width significance oew ^ 3. 
The 2D 2-point correlation function £ag = DD/RR — 1 was calculated using 145 pairs. Plot (b) is of the random pairs normalised to 
the same number of pairs as the real pair data. Plots (c) and (d) show £ag an< i the associated error. There is no evidence for correlation 
between galaxies and Ly-« absorbers within these distances at this level of binning. A 3cr upper-limit of £ag = 2-8 was computed using 
the bin with the maximum value at 7r = 4 — 6, cr = — 1 h^Q 1 Mpc. The central bin has a value of £ag = 0-6 ± 0.9. 



• A result that was shared with Figure 2 of W07 is the 
suggestion of an off-origin maximum in the correlation for 
low density absorbers (logro (Nhi (cm -2 ))= 13 — 15), al- 
though this result was also consistent with a correlation 
equal to 0. The correlation function at the bin of closest 
separation in Figure I12f c') is £ = 0.6 ± 0.9. This is only 
1.2cr lower than the peak along a = — 1 h^ 1 Mpc. That 
both investigations hint at a drop in correlation at small 
separation is only tentative evidence that low-column den- 
sity Ly-a absorbers are not observed at short distances. As 
their column density is so low these absorbers could be eas- 
ily photo-ionised and stripped of their neutral gas when they 
enter the halo. 

We expect lines of a higher column density to show a greater 
clustering. However, with such a small line-list and so few 
absorbers that had logro (Nhi (cm~ 2 ))^ 15 we have not 
measured the cross-correlation of a sub-set with absorbers 
that have a higher column density. Only 18 pairs fit this 
criteria with 2 real pairs in the bin of smallest separation. 



5.5 Comparison with the Galaxy 
Auto-Correlation Function 

To compare with the cross-correlation, the galaxy auto- 
correlation was calculated for all the galaxies with a red- 
shift confidence ^ 4 that were above a redshift of z = 0.68. 
All literature results point to the galaxy auto-correlation 
peaking at the small est separation then decaying in a dis- 
torted radial pa t tern fPollo et al1l2005l; Hawkins et al.|[2003l ; 
ICoil et al.ll2006l ; iGuzzo et alj|200g| ; IZheng et al.ll2007ft . 

In the galaxy auto-correlation function the distribu- 
tion of the random pairs was made by perturbing each real 
galaxy redshift by a random amount that varied between 
8z = ± 0.01 — 0.1. These boundaries were chosen so that 
the large-scale redshift distribution was maintained but any 



galaxy pair that had existed would now no longer appear 
within the 20 x 5 h^ 1 Mpc window. 

The plots in Figure [TjI show the result of this approach. 
Plot (a) of Figure [13] are the 138 DD galaxy pairs. Figure 
[13] plot (b) is of the RR pairs. Plots (c) and (d) show £gg 
and the error, with a central peak of £gg — 10.7 ± 1.4. 

The value in the central bin of £gg is in agreement with 
other galaxy auto-correlation measurements taken at the 
same epoch, despite the smaller number of pairs in our sam- 
ple. F or example from the VVDS-Wide survey (|Pollo et al.l 
l2005f ) which contained ~ 6000 galaxy redshifts at 0.6 ^ z ^ 
1.2. The correlation function contained a peak of £gg > 10 
in the centra l bin and a promin ent "finger-of-god" along the 
line of sight (|Guzzo et al.ll200ct ). 

It is interesting to study the evolution of the galaxy 
auto-correlation in order to compare this with changes 
in the Ly-cv absorber-galaxy cross-correlation. The auto- 
correlation at small separation and at redshifts z ~ 1 is 
lower than at more recent epochs because of the gr avita- 
tional collapse of the halo systems (|Zheng et al. 1 120071 ). For 
example Figure Q3] shows the auto-correlation of ~ 220000 
alaxies at redshifts 0.01 ^ z ^ 0.2 from the 2dFGRS data 



Hawkins et~aT1 l2003h . These have has been re-binned to 
match our plots so that the values for £gg can be com- 
pared. At redshits z ^ 0.2 the value in the central bin is 
£gg = 17.4 ± 2.9 and there is a significant "finger-of-god" 
along the line of sight. 

An alter native way to convey t he difference between the 
results from lHawkins et~al1 l|2003h and our galaxy survey 
is to plot the redshift-space correlation parameter £gg (s), 



6 Data supplied by Dr. P. Norberg, Institute for Astronomy, Uni- 
versity of Edinburgh. 
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Figure 13. Plots (a) and (b) show the DD and normalised RR pairs that were used for the galaxy auto-correlation function. The 
redshifts for the RR pairs were created by perturbing each real redshift by <5z = ± 0.01 — 0.1. Plots (c) and (d) show §gg ( a ^ n ) anc l the 
associated error. The peak in the auto-correlation in the bin of smallest separation is £gg = 10.7 ± 1.4. 
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Fig ure 14. The 2D 2-poin t auto-correlation £ G G of the 2dFGRS 
from lHawkins et al.l ll2003h . The distances have been set to h=0.7 
and the data re-binned to 8n = 2.0, 8<r = 1.0 h^, 1 Mpc. When 
compared to figure 1131 the values for £gg are 2.1 <r higher at the 
centre with a value of £gg = 17.4 ± 2.9. We also see a significant 
"finger-of-god" redshift distortion along a = — 1 h^g 1 Mpc. An 
increase in £gg is expected at lower redshifts due to gravitational 
collapse. 



where, 



s = \/(7r 2 + a 2 ) 
This is shown in Figure [T5l 

5.5.1 Comparing the cross and auto- correlation function 

When £ag from Figure WM c] is compared with £gg from 
Figure ri3t c) we can draw the following conclusion. The val- 
ues of the central bin have a 5.9a difference in the level of 



F0RS2 Survey - z ~1 
2dFGRS - z<0.2 



3 Mpc 



Figur e 15. This com parison between the auto-correlation from 
I Hawkins et al. and our result shows the 1.6a increase in 

the auto-correlation £gg ( s ) °f t ne central bin as the redshift has 
decreased from z ~ 1 to ^ 0.2. 



correlation, with a 5Aa difference between the central peak 
of £gg and the 3rr upper-limit of £ag- 

Therefore there is significant evidence of a difference in 
the cross-correlation between galaxies and Ly-a absorbers 
and the galaxy auto-correlation on scales of 1 Mpc at red- 
shifts ~ 1. 

To check that our results are not dependent on galax- 
ies with less certain redshifts, the cross and auto-correlation 
function were recalculated if pairs containing galaxies that 
had a redshift confidence of 4 were excluded. Galaxies that 
were identified only using [O II] emission constituted the ma- 
jority of the 95 galaxies of those that had a redshift greater 
than z = 0.68. This meant that there were too few pairs for 
both the cross and auto-correlation in each bin. In order to 
use the much smaller data sample we needed to increase the 
bin size. 
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5.6 Variation in the binning and the ID 
correlation function along n and ct 

5.6.1 Binned 2x2 h^ 1 Mpc 

To reduce the effects of shot noise for the smaller galaxy 
sample we increased the size of each bin. The bin size was 
increased to Aty = Act = 2 hfj, 1 Mpc. However, the error 
in the cross-correlation was still comparable with the data 
in most bins. These new plots only extend to an impact 
parameter of 4 hfQ 1 Mpc. 

Figures [16] and [17] show £ag and £gg respectively for 
those galaxies that had a redshift confidence 3. 

Figure \W\ supports the conclusion made earlier. The 
maximum value in the cross-correlation, not at the central 
bin, was now at tt — 12 — 14, ct = — 2 hf x Mpc with 
a value of 0.7 ± 0.6. Therefore when binned to this extent 
there was only a weak correlation between Ly-a absorbers 
and galaxies. We found a 3ct upper-limit of £ag = 1-9. 

Comparisons with the galaxy auto-correlation also re- 
mained the same. When binned to A-zr = Act = 2 h^ Mpc, 
(where averaging over 2 hyQ 1 Mpc along a caused £gg to de- 
crease), in the central bin there is still stronger clustering 
amongst galaxies than between galaxies and Ly-a absorbers 
at a 6. Oct level of confidence. The central bin in the auto- 
correlation now had a value of £gg= 7.4 ± 1.0. 

When the galaxies with a redshift confidence ^ 4 were 
used at this level of binning no significant difference was 
observed. The cross and auto-correlation had values in the 
central bin of £ag = 0.7 ± 0.5 and £gg = 6.1 ± 0.7 respec- 
tively. 



5.6.2 Summed over the line of sight and projected 
separation 

In order to further reduce the Poisson error in the correlation 
function we collapsed the pair counts along each dimension 
entirely. As in the previous section, in order to maximise the 
number of pairs, we include galaxies with a redshift confi- 
dence ^ 4. 

Plots (a), (b) and (c) of Figure [TH] show the number of 
real pairs, the number of random pairs and £ag when binned 
along the projected separation with Act = 0.5 hfg 1 Mpc. 
This is the projected correlation S (ct) where; 

3 (a-) = J £(cr,it) dn 
o 

In practice the upper limit was taken to be the maximum 
separation of the pairs when binned in the 2D plots, tt = 20 
h^Q Mpc. The absolute separation between the absorbers 
and galaxies was used, therefore the lower limit was zero. 

By summing over the line of sight we also removed any 
peculiar velocity distortions that may have been present 
since these only displace pairs along the tt axis. 

The general shape of plot (a) of Figurell8lsupports what 
has already been mentioned in Section 15.11 As the survey 
volume increased the number of galaxies, hence the number 
of pairs also increased. This peaked between 1-2 hfQ 1 Mpc 
before decreasing because galaxies were now located at the 
edge of the surveyed region. The smaller number of pairs at 
ct = 2 — 2.5 h^Q 1 Mpc is real and can be deduced from the 



Pie-diagrams in Figure [TT] In the field of view of HE 1 122- 
1648 there were only 4 galaxies that were at a separation of 
2 - 2.5 hfo 1 Mpc. In the field of PKS 1127-145 there were 
20, however only 4 of these were between the same redshift 
range as the absorbers. 

The offset peak in plot (c) of Figure [TH] at ct = 1 —5- 1.5 
h^Q Mpc could be tentative evidence that low column den- 
sity absorbers are more scarce the closer the distance to a 
galactic halo. However this peak is not significant with an 
increase of only 1.3ct from ct = — 0.5 h^ 1 Mpc. The level of 
correlation then remained the same, as there was no signif- 
icant change in Hag (cr) out to ct = 4 h^, 1 Mpc. The 2 bins 
above a — 4 h^, 1 Mpc have been ignored in this comparison 
as only 4 real pairs were found beyond this distance. 

When binned using Att — 2 h^ 1 Mpc along the line 
of sight, the weak correlation £ag (t) shown in plot (f) of 
Figure [18] did not have any significant change in value as tt 
increased from — 20 hfg 1 Mpc. 



6 DISCUSSION AND CONCLUSION 

The two straw-man models mentioned in the introduction 
were that the gas follows the galaxy distribution and collects 
in galaxy haloes, or it forms part of a diffuse network, located 
near but not necessarily causally linked with galaxies. 

Figure [12] showed that we did not detect significant 
correlation between galaxies and Ly-a absorbers that have 
13 ^logio (Nhi (cm _2 ))sC 17.4. As only 145 absorber-galaxy 
pairs existed within Act = — 5, An = — 20 h^ 1 Mpc, in 
Figure n"27 a). no cuts in the column density were made as 
this would further increase an already large uncertainty. An 
overwhelming majority of the Ly-a absorbers from the STIS 
spectra however had logio (Nhi (cm~ 2 ))^ 15. 

We calculated a 3ct upper-limit of £ag= 2.8. The de- 
crease in the 3ct upperlimit to 1.9 in Figure [TBI caused by 
the increase in binsize to Act = Air — 2 h^ 1 Mpc is further 
evidence that £ag is near to when averaged over these dis- 
tances. This result was supported by the ID plots of £ag(tt) 
and Hag (c) along the line of sight and projected separation 
in Figure \TE\ 

The value in the central bin for the galaxy auto- 
correlation function was £gg= 10.7 ± 1.4. This was a 5.9ct 
increase on the value in the equivalent bin of the absorber- 
galaxy cross-correlation that had a value of £ag= 0.6 ± 0.9. 

Even when the value for the auto-correlation was re- 
duced by the larger binning in Figure 1171 the difference be- 
tween the 3-ct upper-limit in the cross correlation and the 
central bin in the auto-correlation was still 5.5ct. 

This evidence implies that at a redshift of z ~ 1 galax- 
ies that inhabit the nodes and filaments of the dark matter 
cosmic web are not strongly correlated with the low col- 
umn density Ly-a absorbers (logio (Nhi (cm" ~ 2 ))s5 17) that 
loosely trace this filamentary structure. 

With a catalogue of 95 galaxies within a redshift range 
of z — 0.68 — 1.51 that produced 138 galaxy pairs, it is un- 
likely that we would have been able to reproduce the result 
and distributio n of pairs of the ga laxy auto-correlation func - 
tion found by IPoIIo et all (|2005l l and iGuzzo et all (|2008h . 
However our results were similar within the margin of error. 
When these studies are compared with the results at z ^ 0.2 
there is a noticeable decrease in the auto-correlation. 
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Figure 16. Plots a-d are of the 50 absorber-galaxy DD pairs, RR pairs, £ag ( cr ) 7r ) an d error respectively when the level of binning has 
been increased to Air = Aa = 2 h^Q 1 Mpc. The error per bin has decreased and the peak in correlation is now 0.7 ± 0.6 at 7r = 12 — 14, 
<t — — 2 h^Q 1 Mpc. The result is the same as to what was observed in Figure 1121 This negligible degree of correlation also does not 
change significantly along the line of sight as the the value varies by ~ lcr only. Values for £ag a l so do not significantly change when at 
a projected separation of a = 2 — 4 h^ Mpc. 
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Figure 17. Plots a-d are of the 42 real galaxy pairs, DD, random-random galaxy pairs, RR, and the galaxy auto-correlation, £gg an d 
error respectively. When binned Air = Aa = 2 h^" Mpc the same conclusions can be drawn for the auto-correlation function. Correlation 
between galaxies decreased along the line of sight, until after ir ~ 6 h^ 1 Mpc the correlation signal was comparable with £ag (""i 71 ")- 



When our results are compared with the values of the 
cross-correlation at redshifts <C 0.5 the data suggest that 
the cross-correlation between low column density Ly-a ab- 
sorbers and galaxies does not have a strong redshift de- 
pendence. This may be because of the flat evolution in 
the Ly-q absorber den sity distribution at redshifts < 1.7 
jjanknecht et all l2006h . 

This conclusion is a stark contrast to that which was 
found by RW06. In that investigation £ag had been found 
to be comparable with £gg with values w 11 in the closest 
bin. There are three possible reasons for this difference. First 
of all the dataset of RW06 was at low redshift, where cluster- 
ing is thought to have increased because of gravitational col- 



lapse. However an increase of this magnitude over a change 
in redshift of z ~ 1 — > d oes not seem plausible , particularly 
when r esults gathered by Wilman et alJ ll2007h. IChen et al.l 
l|2005l ): IChen fc Mulchaevl Hx!) and lPierleoni et alJ (|2008h 
cite lower values for £ag at redshifts of z < 0.5 and 0. 



Another reason may be because of the galaxy popula- 
tion. RW06 used the HIPASS galaxies that were detected 
using 21 cm emission and targeted in a blind survey. There- 
fore as well as being gas rich it included galaxies of any mass. 
Hence a strong correlation and velocity dispersion along the 
line-of-sight were expected as gas falls through the poten- 
tial of the filamentary structure and is accreted directly by 
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Figure 18. Plots (a) and (b) show the number of DD and RR absorber-galaxy pairs when they are binned along the projected separation. 
The shape of these histograms are a consequence of the edge effects discussed and illustrated in figure [9] The lack of pairs at a = 2.5 
h^" Mpc is compensated in plot (b) by using the perturbed redshift technique. Plot (c) suggests an off-centre peak in the cross-correlation 
at o" — l — 1.5 h^, 1 Mpc. This peak is not significant with an increase of only 1.3<r from a = — 0.5 h^g 1 Mpc. There is no significant 
change in Sag ( ct ) ou t to <r = 4 h^ 1 Mpc. Plots (d) and (e) show the number of DD and RR absorber-galaxy pairs when binned along 
the line of sight. Plot (f) shows the cross-correlation, £ag ("0 m steps of Sw = 2 h^, 1 Mpc. At these levels of significance there is no 
evidence for any correlation between the galaxies and low column density absorbers. What is important is this lack of correlation is 
constant along the line of sight with a variance of less that la as n increases from 0-18 h™ 1 Mpc. 



the low-mas s galaxies. This is t he method of 'cold-accretion' 
proposed in iKeres et all (120051 ) . 

Our magnitude limited galaxies (Mb ~ — 17 — s> —21) 
were collected in an incomplete survey at higher redshifts 
than that of the HIPASS catalogue. Consequently the galax- 
ies for which we were able to get spectroscopic data would 
have a higher ma ss than most of the HIPASS candidates. 
IKeres et alj (|2005l ) claim that the dominant mode of accre- 
tion for galaxies with a dark halo mass M > 10 11 ' 4 M is 
the more conventional method; where gas is shock heated 
to the virial temperature of the halo before cooling and col- 
lapsing onto the galaxy. This could mean less neutral gas 
is expected in the immediate vicinity of the galaxies in our 
sample when compared to those from RW06. 

Anot her reason why th e cross-correlation may seem 
larger in iRvan-Weberl (2006) is because of the smaller bin 
size of Act = Aty = 0.1 h^Mpc that was used. We could 
argue that the absorbers that are at a < 0.1 h^Mpc are 
contained within the galaxy halo and may even be virialised, 
so £ag per bin is likely to substantially increase. 

Alternatively these discrep ancies could b e du e 
to the pro bl em s ug gested by Pierleoni et al.l 1 20081). 



IRvan-Weberl (120061'); 1 Wilman et al.l (|2007l V, IChen et all 
(20051); IChen fc Mulchaevl (|2009) and to a greater extent 
our dataset all either contain few pairs and/or few lines- 
of-sight. The appearance or absence of artifacts such as a 
strong "finger-of-god" in the correlation pattern could be 
due to the small size of the sample. 

Unfortunately because we only had 13 galaxies in the 
catalogue that were dominated by abs orption features we 
wer e unable to test the hyp othesis of IChen et al.l (|2005l ) 
and lChen Sz Mulchaevl (|2009h that £ag is stronger for those 



galaxies that are dominated by emission and comparable 
to the emission dominated galaxy auto-correlation function. 
However we still see a low correlation in the smallest bin, 
despite the abundance of emission dominated galaxies in our 
sample. 

We are hoping to improve on these findings with more 
data collected from three further STIS E230M lines-of-sight 
and galaxies from the VIMOS instrument. This will enable 
us to correlate galaxies and Ly-a absorbers out to a pro- 
jected separation of a ~ 7 hf Mpc at a redshift of z ~ 1. We 
also plan to investigate the 2D 2-point correlation function 
using the Galaxies-Intergalactic Medium Interactio n Calcu- 
lation (GIMIC) SPH simulation (|Crain et al.ll2009l ). 



7 A SUMMARY OF THE RESULTS ON THE 2 
POINT CORRELATION FUNCTION 

We have set the first limits on the 2D 2-point correla- 
tion function between Ly-a absorbers of a column density 
13.2 ^logio (Nhi (cm -2 ))^ 17.4 and galaxies from a magni- 
tude limited survey of 21.5 ^ R ^ 24.5 at z ~ 1, and have 
compared this result to the galaxy auto-correlation. 

When binned over both the projected separation and 
line of sight out to 5 and 20 h^, 1 Mpc respectively with Act = 
1, A-7T = 2 h^Q 1 Mpc, there was no sign at a 3ct level for any 
significant correlation. The peak value at ct = — 1, 7r = 4 — 6 
h^ Mpc gave a 3ct upper-limit of £ag — 2.8. A 5.4ct 
difference was found between this upper-limit and our value 
for the galaxy auto-correlation function. This result is robust 
even when we remove subsets of the galaxy sample that have 
less secure redshifts. 
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Our value for the galaxy auto-correlation, £gg= 10.7 ± 
1.4 at <7 = 0— 1, 7r = — 2 h^Q 1 Mpc was consistent with 
the VVDS-Wide Survey that was taken at the same epoch. 
Both of these surveys show a lower level of correlation than 
that of the 2dFGRS survey at redshifts z ^ 0.2, which is 
consistent with the evolution that is expected. 

We found no difference b etween our results and £ag 
found bv lWilman et all (|2007T l who studied £ag at z ~ 0.5. 
Thus our results are consistent with there being no signifi- 
cant evolution i n £ag with redshift. The much greater value 
of £ag found by iRvan- W eber (2006) at z = that was com- 
parable to £gg could be because the galaxies in her sample 
were gas rich and have a lower mass. 
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